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3 <110> APPLICANT: YAMAKI , TOSHIFUMI 

4 BANBA, SHINICHI 

5 MATOISHI, KAORI 

6 I TO, KYOSHI 

7 KOBAYASHI, HIDEKI 

8 TANAKA, EI SHI 

9 OIKAWA, TOSHIHIRO 

11 <120> TITLE OF INVENTION: NOVEL NITRILE HYDRATASE 

13 <130> FILE REFERENCE: 018765-218 

15 <140> CURRENT APPLICATION NUMBER: 10/539,560 



C--> 16 <141> CURRENT FILING DATE: 2005-06-17 

18 <150> PRIOR APPLICATION NUMBER: PCT/ JP03 /016014 

19 <151> PRIOR FILING DATE: 2003-12-15 




21 


<150> 


PRIOR 


APPLICATION 


NUMBER: 


JP : 


2003 


-379280 










22 


<151> 


PRIOR 


FILING DATE 


: 2003-11-10 
















24 


<150> 


PRIOR 


APPLICATION 


NUMBER: 


jp : 


2002- 


-368360 










25 


<151> 


PRIOR 


FILING DATE 


: 2002-12-19 
















27 


<160> 


NUMBER OF 


SEQ 


ID NOS: 


142 


















29 


<170> 


SOFTWARE : 


Patent In Ver. 3 


.3 
















31 


<210> 


SEQ ID NO 


: 1 
























32 


<211> 


LENGTH: 205 
























33 


<212> 


TYPE: 


PRT 


























34 


<213> 


ORGANISM: 


Pseudonocardia i 


thermophila 












36 


<400> 


SEQUENCE : 


1 
























37 


Met 


Thr 


Glu 


Asn 


He 


Leu 


Arg 


Lys 


Ser 


Asp 


Glu 


Glu 


He 


Gin 


Lys 


Glu 


38 


1 










5 










10 










15 




40 


lie 


Thr 


Ala 


Arg 


Val 


Lys 


Ala 


Leu 


Glu 


Ser 


Met 


Leu 


He 


Glu Gin Gly 


41 










20 










25 










30 






43 


He 


Leu 


Thr 


Thr 


Ser 


Met 


He 


Asp 


Arg 


Met 


Ala 


Glu 


He 


Tyr 


Glu 


Asn 


44 








35 










40 










45 








46 


Glu 


Val 


Gly 


Pro 


His 


Leu 


Gly 


Ala 


Lys 


Val 


Val 


Val 


Lys 


Ala 


Trp 


Thr 


47 




50 










55 










60 










49 


Asp 


Pro 


Glu 


Phe 


Lys 


Lys 


Arg 


Leu 


Leu 


Ala 


Asp 


Gly Thr 


Glu 


Ala 


Cys 


50 


65 












70 










75 










80 


52 


Lys 


Glu 


Leu 


Gly 


He 


Gly 


Gly 


Leu 


Gin 


Gly 


Glu 


Asp Met 


Met 


Trp 


Val 


53 












85 










90 










95 




55 


Glu 


Asn 


Thr 


Asp 


Glu 


Val 


His 


His 


Val 


Val 


Val 


Cys 


Thr 


Leu 


Cys 


Ser 


56 










100 










105 










110 






58 


Cys 


Tyr 


Pro 


Trp 


Pro 


Val 


Leu 


Gly 


Leu 


Pro 


Pro 


Asn 


Trp 


Phe 


Lys 


Glu 


59 








115 










120 










125 








61 


Pro 


Gin 


Tyr 


Arg 


Ser 


Arg 


Val 


Val 


Arg 


Glu 


Pro 


Arg 


Gin 


Leu 


Leu 


Lys 


62 




130 










135 










140 










64 


Glu 


Glu 


Phe 


Gly 


Phe 


Glu 


Val 


Pro 


Pro 


Ser 


Lys 


Glu 


He 


Lys 


Val 


Trp 



file://C:\CRF4\OUTHOLD\VsrJ539560.htm 



1/31/2006 



Page 2 of 8 



RAW SEQUENCE LISTING DATE: 01/31/2006 

PATENT APPLICATION: US/10/539,560 TIME: 16:14:19 

Input Set : A:\18765218.APP 

Output Set: N:\CRF4\01312006\J539560.raw 

65 145 150 155 160 

67 Asp Ser Ser Ser Glu Met Arg Phe Val Val Leu Pro Gin Arg Pro Ala 

68 165 170 175 

70 Gly Thr Asp Gly Trp Ser Glu Glu Glu Leu Ala Thr Leu Val Thr Arg 

71 180 185 190 

73 Glu Ser Met lie Gly Val Glu Pro Ala Lys Ala Val Ala 

74 195 200 205 

77 <210> SEQ ID NO: 2 

78 <211> LENGTH: 233 

79 <212> TYPE: PRT 

80 <213> ORGANISM: Pseudonocardia thermophila 
82 <400> SEQUENCE: 2 



83 


Met 


Asn 


Gly 


Val 


Tyr 


Asp 


Val 


Gly 


Gly 


Thr Asp 


Gly Leu 


Gly 


Pro He 


84 


1 








5 










10 






15 


86 


Asn 


Arg 


Pro 


Ala 


Asp 


Glu 


Pro 


Val 


Phe 


Arg Ala 


Glu Trp 


Glu 


Lys Val 


87 








20 










25 






30 




89 


Ala 


Phe 


Ala 


Met 


Phe 


Pro 


Ala 


Thr 


Phe 


Arg Ala 


Gly Phe 


Met 


Gly Leu 


90 






35 










40 






45 






92 


Asp 


Glu 


Phe 


Arg 


Phe 


Gly 


He 


Glu 


Gin 


Met Asn 


Pro Ala 


Glu 


Tyr Leu 


93 




50 










55 








60 






95 


Glu 


Ser 


Pro 


Tyr 


Tyr 


Trp 


His 


Trp 


He 


Arg Thr 


Tyr He 


His 


His Gly 


96 


65 










70 








75 






80 


98 


Val 


Arg 


Thr 


Gly 


Lys 


He 


Asp 


Leu 


Glu 


Glu Leu 


Glu Arg 


Arg 


Thr Gin 


99 










85 










90 






95 



101 


Tyr 


Tyr 


Arg 


Glu 


Asn 


Pro 


Asp 


Ala 


Pro 


Leu 


Pro 


Glu 


His 


Glu 


Gin 


Lys 


102 








100 










105 










110 






104 


Pro 


Glu 


Leu 


He 


Glu 


Phe 


Val 


Asn 


Gin 


Ala 


Val 


Tyr 


Gly 


Gly 


Leu 


Pro 


105 






115 










120 










125 








107 


Ala 


Ser 


Arg 


Glu 


Val 


Asp 


Arg 


Pro 


Pro 


Lys 


Phe 


Lys 


Glu 


Gly 


Asp 


Val 


108 




130 










135 










140 










110 


Val 


Arg 


Phe 


Ser 


Thr 


Ala 


Ser 


Pro 


Lys 


Gly 


His 


Ala 


Arg 


Arg 


Ala 


Arg 


111 


145 










150 










155 










160 


113 


Tyr 


Val 


Arg 


Gly 


Lys 


Thr 


Gly 


Thr 


Val 


Val 


Lys 


His 


His 


Gly 


Ala 


Tyr 


114 










165 










170 










175 




116 


He 


Tyr 


Pro 


Asp 


Thr 


Ala 


Gly 


Asn 


Gly 


Leu 


Gly 


Glu 


Cys 


Pro 


Glu 


His 


117 








180 










185 










190 






119 


Leu 


Tyr 


Thr 


Val 


Arg 


Phe 


Thr 


Ala 


Gin 


Glu 


Leu 


Trp 


Gly 


Pro 


Glu 


Gly 


120 






195 










200 










205 








122 


Asp 


Pro 


Asn 


Ser 


Ser 


Val 


Tyr 


Tyr 


Asp 


Cys 


Trp 


Glu 


Pro 


Tyr 


He 


Glu 


123 




210 










215 










220 










125 


Leu 


Val 


Asp 


Thr 


Lys 


Ala 


Ala 


Ala 


Ala 

















126 225 230 

129 <210> SEQ ID NO: 3 

130 <211> LENGTH: 618 

131 <212> TYPE: DNA 

132 <213> ORGANISM: Pseudonocardia thermophila 

134 <400> SEQUENCE: 3 

135 atgaccgaga acatcctgcg caagtcggac gaggagatcc agaaggagat cacggcgcgg 60 

136 gtcaaggccc tggagtcgat gctcatcgaa cagggcatcc tcaccacgtc gatgatcgac 120 
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137 cggatggccg agatctacga gaacgaggtc ggcccgcacc tcggcgcgaa ggtcgtcgtg 180 

138 aaggcctgga ccgacccgga gttcaagaag cgtctgctcg ccgacggcac cgaggcctgc 240 

139 aaggagctcg gcatcggcgg cctgcagggc gaggacatga tgtgggtgga gaacaccgac 300 

140 gaggtccacc acgtcgtcgt gtgcacgctc tgctcctgct acccgtggcc ggtgctgggg 360 

141 ctgccgccga actggttcaa ggagccgcag taccgctccc gcgtggtgcg tgagccccgg 420 

142 cagctgctca aggaggagtt cggcttcgag gtcccgccga gcaaggagat caaggtctgg 480 

143 gactccagct ccgagatgcg cttcgtcgtc ctcccgcagc gccccgcggg caccgacggg 540 

144 tggagcgagg aggagctcgc caccctcgtc acccgcgagt cgatgatcgg cgtcgaaccg 6 00 

145 gcgaaggcgg tcgcgtga 618 

148 <210> SEQ ID NO: 4 

149 <211> LENGTH: 702 

150 <212> TYPE: DNA 

151 <213> ORGANISM: Pseudonocardia thermophila 

153 <400> SEQUENCE: 4 

154 atgaacggcg tgtacgacgt cggcggcacc gatgggctgg gcccgatcaa ccggcccgcg 60 

155 gacgaaccgg tcttccgcgc cgagtgggag aaggtcgcgt tcgcgatgtt cccggcgacg 120 

156 ttccgggccg gcttcatggg cctggacgag ttccggttcg gcatcgagca gatgaacccg 180 

157 gccgagtacc tcgagtcgcc gtactactgg cactggatcc gcacctacat ccaccacggc 240 

158 gtccgcaccg gcaagatcga tctcgaggag ctggagcgcc gcacgcagta ctaccgggag 300 

159 aaccccgacg ccccgctgcc cgagcacgag cagaagccgg agttgatcga gttcgtcaac 360 

160 caggccgtct acggcgggct gcccgcaagc cgggaggtcg accgaccgcc caagttcaag 420 

161 gagggcgacg tggtgcggtt ctccaccgcg agcccgaagg gccacgcccg gcgcgcgcgg 480 

162 tacgtgcgcg gcaagaccgg gacggtggtc aagcaccacg gcgcgtacat ctacccggac 540 

163 accgccggca acggcctggg cgagtgcccc gagcacctct acaccgtccg cttcacggcc 600 

164 caggagctgt gggggccgga aggggacccg aactccagcg tctactacga ctgctgggag 660 

165 ccctacatcg agctcgtcga cacgaaggcg gccgcggcat ga 702 

168 <210> SEQ ID NO: 5 

169 <211> LENGTH: 144 

170 <212> TYPE: PRT 

171 <213> ORGANISM: Pseudonocardia thermophila 



173 


<400> SEQUENCE: 


5 




















174 


Met 


Ser 


Ala 


Glu 


Ala 


Lys 


Val 


Arg 


Leu 


Lys His 


Cys 


Pro Thr 


Ala 


Glu 


175 


1 








5 










10 






15 




177 


Asp 


Arg 


Ala 


Ala 


Ala 


Asp 


Ala 


Leu 


Leu 


Ala Gin 


Leu 


Pro Gly Gly Asp 


178 








20 










25 






30 






180 


Arg 


Ala 


Leu 


Asp 


Arg 


Gly 


Phe 


Asp 


Glu 


Pro Trp 


Gin 


Leu Arg 


Ala 


Phe 


181 






35 










40 








45 






183 


Ala 


Leu 


Ala 


Val 


Ala 


Ala 


Cys 


Arg 


Ala 


Gly Arg 


Phe 


Glu Trp 


Lys 


Gin 


184 




50 










55 








60 








186 


Leu 


Gin 


Gin 


Ala 


Leu 


He 


Ser 


Ser 


He 


Gly Glu 


Trp Glu Arg 


Thr 


His 


187 


65 










70 








75 








80 


189 


Asp 


Leu 


Asp 


Asp 


Pro 


Ser 


Trp 


Ser 


Tyr 


Tyr Glu 


His 


Phe Val 


Ala 


Ala 


190 










85 










90 






95 




192 


Leu 


Glu 


Ser 


Val 


Leu 


Gly 


Glu 


Glu Gly 


He Val 


Glu 


Pro Glu 


Ala 


Leu 


193 








100 










105 






110 






195 


Asp 


Glu 


Arg 


Thr 


Ala 


Glu 


Val 


Leu 


Ala 


Asn Pro 


Pro 


Ash Lys Asp 


His 


196 






115 










120 








125 






198 


His 


Gly 


Pro 


His 


Leu 


Glu 


Pro 


Val 


Ala 


Val His 


Pro 


Ala Val 


Arg 


Ser 


199 




130 










135 








140 
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205 <210> SEQ ID NO: 6 

206 <211> LENGTH: 435 

207 <212> TYPE: DNA 

208 <213> ORGANISM: Pseudonocardia thermophila 

210 <400> SEQUENCE: 6 

211 gtgagcgccg aggcgaaggt ccgcctgaag cactgcccca cggccgagga ccgggcggcg 60 

212 gccgacgcgc tgctcgcgca gctgcccggc ggcgaccgcg cgctcgaccg cggcttcgac 120 

213 gagccgtggc agctgcgggc gttcgcgctg gcggtcgcgg cgtgcagggc gggccggttc 180 

214 gagtggaagc agctgcagca ggcgctgatc tcctcgatcg gggagtggga gcgcacccac 240 

215 gatctcgacg atccgagctg gtcctactac gagcacttcg tcgccgcgct ggaatccgtg 3 00 

216 ctcggcgagg aagggatcgt cgagccggag gcgctggacg agcgcaccgc ggaggtcttg 360 

217 gccaacccgc cgaacaagga tcaccatgga ccgcatctgg agcccgtcgc ggtccacccg 420 

218 gccgtgcggt cctga 435 

221 <210> SEQ ID NO: 7 

222 <211> LENGTH: 18 

223 <212> TYPE: DNA 

224 <213> ORGANISM: Artificial Sequence 

226 <220> FEATURE: 

227 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

228 primer 

230 <400> SEQUENCE: 7 

231 aacatcatgc gcaagtcg 18 

234 <210> SEQ ID NO: 8 

235 <211> LENGTH: 17 

236 <212> TYPE: DNA 

237 <213> ORGANISM: Artificial Sequence 

239 <220> FEATURE: 

240 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

241 primer 

243 <400> SEQUENCE: 8 

244 gttttcccag tcacgac 17 

247 <210> SEQ ID NO: 9 

248 <211> LENGTH: 20 

249 <212> TYPE: DNA 

250 <213> ORGANISM: Artificial Sequence 

252 <220> FEATURE: 

253 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

254 primer 

256 <400> SEQUENCE: 9 

257 ggccagtgcc tagcttacat 20 

260 <210> SEQ ID NO: 10 

261 <211> LENGTH: 17 

262 <212> TYPE: DNA 

263 <213> ORGANISM: Artificial Sequence 

265 <220> FEATURE: 

266 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

267 primer 

269 <400> SEQUENCE: 10 

270 caggaaacag ctatgac 17 
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273 


<210> SEQ ID NO: 11 






274 


<211> LENGTH: 18 






275 


<212> TYPE: DNA 






276 


<213> ORGANISM: Artificial Sequence 






278 


<220> FEATURE: 




• 


279 


<223> OTHER INFORMATION: Description of Artificial 


Sequence : 


Synthetic 


280 


primer 






282 


<400> SEQUENCE: 11 






283 


aacatcacgc gcaagtcg 




18 


286 


<210> SEQ ID NO: 12 






287 


<211> LENGTH: 18 






288 


<212> TYPE: DNA 






289 


<213> ORGANISM: Artificial Sequence 






291 


<220> FEATURE: 






292 


<223> OTHER INFORMATION: Description of Artificial 


Sequence : 


Synthetic 


293 


primer 






295 


<400> SEQUENCE: 12 






296 


aacatcgcgc gcaagtcg 




18 


299 


<210> SEQ ID NO: 13 






300 


<211> LENGTH: 18 






301 


<212> TYPE: DNA 






302 


<213> ORGANISM: Artificial Sequence 






304 


<220> FEATURE: 






305 


<223> OTHER INFORMATION: Description of Artificial 


Sequence : 


Synthetic 


306 


primer 






308 


<400> SEQUENCE: 13 






309 


aacatcgtgc gcaagtcg 




18 


312 


<210> SEQ ID NO: 14 






313 


<211> LENGTH: 18 






314 


<212> TYPE: DNA 






315 


<213> ORGANISM: Artificial Sequence 






317 


<220> FEATURE: 






318 


<223> OTHER INFORMATION: Description of Artificial 


Sequence : 


Synthetic 


319 


primer 






321 


<400> SEQUENCE: 14 






322 


atcacggtgc gggtcaag 




18 


325 


<210> SEQ ID NO: 15 






326 


<211> LENGTH: 18 






327 


<212> TYPE: DNA 






328 


<213> ORGANISM: Artificial Sequence 






330 


<220> FEATURE: 






331 


<223> OTHER INFORMATION: Description of Artificial 


Sequence : 


Synthetic 


.332 


primer 






334 


<400> SEQUENCE: 15 






335 


acgtcgttga tcgaccgg 




18 


338 


<210> SEQ ID NO: 16 






339 


<211> LENGTH: 18 






340 


<212> TYPE: DNA 






341 


<213> ORGANISM: Artificial Sequence 
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Please Note; 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> 
to <223> fields of each sequence which presents at least one n or Xaa. 





' 110 


• N 


Pos 


14,15 




' 111 


' N 


Pos 


7, 


8 , 


9 




' 112 

> J> .L ^ J 


• N 


Pos 


7 , 


8 , 


9 




' 113 
t U. ^ -J i 




Pns 


7 ^ 


8 . 


9 




• ^ J. *x , 




JrvJO • 


7 , 


8 . 


9 




> X J. ^ 1 


■ N 

F 




7 , 


, 8 . 


9 




t X W 1 


F ^™ 


It wo • 


7 , 


, 8 . 


9 




■117 


• N 




7 




9 




'lift 

> X X O J 


rM 




7 


8 


9 


oeqff . 


• 1 1 Q 


r IN 


Po C 


7 


8 


9 


C/arrit • 


» 1 9 n 
. X ^ u 4 




Po G 
XT WO . 


7 


8 


9 


oetitr < 


•191 

> X ^ X J 


F 


It Wo • 


7 


8 


9 






IN 


Poc 


7, 


Q 




oeq# , 


.191 


; N 


Pos. 


7. 


'8, 


9 


Seq#: 


:124, 


r N 


Pos . 


7, 


8, 


9 


Seq#. 


: 125, 


; N 


Pos . 


7, 


8, 


9 


Seq#: 


:126, 


? N 


Pos . 


7. 


8, 


9 


Seq#: 


:127, 


? N 


Pes . 


1 , 


8, 


9 


Seq#: 


:128, 


r N 


Pos . 


7, 


8, 


9 


Seq#: 


:129, 


f N 


Pos. 


7, 


8, 


9 


Seq#: 


:130, 


F N 


Pos. 


7, 


-8, 


9 


Seq#: 


:131, 


? N 


Pos. 


7, 


8, 


9 


Seq# 


:132, 


r N 


Pes . 


7, 


8, 


9 


Seq#: 


:133, 


? N 


Pos . 


7, 


8, 


9 


Seq#: 


:134, 


? N 


Pos . 


7, 


8, 


9 


Seq#: 


:135, 


r N 


Pos. 


7, 


8, 


9 


Seq#; 


:136, 


F N 


Pos. 


7, 


8, 


9 


Seq#: 


:137, 


r N 


Pos. 


7, 


'8, 


9 


Seq#: 


:138, 


r N 


Pos . 


7, 


8, 


9 


Seq# 


:139, 


r N 


Pos. 


7, 


8, 


9 


Seq#: 


:142j 


r Xaa Pes. 


1, 


3 



3, 8, 9, 10, 11 



fiIe://C:\CRF4\OUTHOLD\VsrJ539560.htm 



1/31/2006 



Page 7 of 8 



VERIFICATION SUMMARY 
PATENT APPLICATION: US/10/539,560 

Input Set : A:\18765218.APP 

Output Set: N:\CRF4\01312006\J539560.raw 



DATE: 01/31/2006 
TIME: 16:14:20 



L:16 M 
L:1666 
L:1802 
L:1820 
Ii:1838 
L:1856 
L:1874 
L:1892 
L:1910 
L:1928 
L:1946 
L:1964 
L:1982 
L:2000 
L:2018 
L:2036 
L:2054 
L:2072 
L:2090 
L:2108 
L:2126 
L:2144 
L:2162 
L:2180 
L:2198 
L:2216 
L:2234 
L:2252 
L:2270 
L:2288 
L:2306 
L:2324 
L:2466 



271 C 
M:336 
M:341 
M:341 
M:341 
M:341 
M:341 
M:341 
M:341 
M:341 
M:341 
M:341 
M:341 
M:341 
M:341 
M:341 
M:341 
M:341 
M:341 
M:341 
M:341 
M:341 
M:341 
M:341 
M:341 
M:341 
M:341 
M:341 
M:341 
M:341 
M:341 
M:341 
M:341 



Current 

W : Inval 

W: (46 

W: (46 

W: (46 

W: (46 

W: (46 

W: (46 

W: (46 

W: (46 

W: (46 

W: (46 

W: (46 

W: (46 

W: (46 

W: (46 

W: (46 

W: (46 

W: (46 

W: (46 

W: (46 

W: (46 

W: (46 

W: (46 

W: (46 

W: (46 

W: (46 

W: (46 

W: (46 

W: (46 

W: (46 

W: (46 

W: (46 



Filing Date differs, Replaced 
id Amino Acid Number in Coding 

or "Xaa" used, for SEQ ID#: 

or "Xaa" used, for SEQ ID#: 

or "Xaa" used, for SEQ ID#: 

or "Xaa" used, for SEQ ID#: 

or "Xaa" used, for SEQ ID#: 

or "Xaa" used, for SEQ ID#: 

or "Xaa" used, for SEQ ID#: 

or "Xaa" used, for SEQ ID#: 

or "Xaa" used, for. SEQ ID#: 
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Region, SEQ ID 
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